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Abstract —The polarization process of polar codes over a 
ternary alphabet is studied. Recently it has been shown that the 
scaling of the blocklength of polar codes with prime alphabet size 
scales polynomially with respect to the inverse of the gap between 
code rate and channel capacity. However, except for the binary 
case, the degree of the polynomial in the bound is extremely large. 
In this work, it is shown that a much lower degree polynomial 
can be computed numerically for the ternary case. Similar results 
are conjectured for the general case of prime alphabet size. 


We can rewrite © as I(W) = ]> ~2 G W (G) G, where 

W(G)± J2 W M ( 4 ) 

y:H[v(y)]=l-G 

A basic polarization transformation of a channel W forms two 
channels, W~ = W ffl W and W + = W ® W. Recall that 
given two channels, W a and W b , W a ® b = W a ffl W b is defined 
by 
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I. Introduction 

Polar codes for transmission over binary discrete memo¬ 
ryless channels (DMCs) were introduced by Arikan JTj, and 
were further analyzed in 0. These results were extended to 
q-ary polarization for an arbitrary prime q in 0-0. 

For the binary case it was shown that the blocklength 
required to transmit reliably scales polynomially with respect 
to the inverse of the gap between code rate and channel 
capacity 0-0. This result was recently extended to <y-ary 
channels for an arbitrary prime q O but in the new bound, 
the degree of this polynomial is extremely large. 

In this paper we obtain numerically a much better bound 
for q = 3. For that purpose we obtain numerically a lower 
bound on the size of a basic polarization step which is higher 
than the one for the binary case. We conjecture similar results 
for any prime value of the alphabet size, q. 

II. Preliminaries 
A. General definitions and results 

We follow the notations of 0 Lemma 5]. For the g-ary 
channel W ( y \ x ), we define W(y) = (1 /q) o W (y\x) 
and the vector v(y) = [v 0 (y),v 1 (y), u g _i(t/)] T where 

Vz g {0,1 ,..., q - 1} : v x (y) = j ^ ■ (1) 

qW(y) 

Note that v x{y) = 1 and the symmetric capacity is 

i(w) = J2 w (y){ 1 - H Wy)}} ( 2 ) 

v 

where 

9-1 

h [v(t/)] = - v * ( y ) iog. My) ■ (3) 

x=0 


l ?_1 

W a ®b (yi, Vi | u) = - W b ( 2/2 | u') W a {yi\u + v!) 

^ u'=0 

Hence W aBb (yi,y 2 ) = W a (yi)W b (y 2 ) and 0 Proof of 
Lemma 6] 


9-1 

Va®b,u (yi, 2 / 2 ) = X Vb ’ u ' (y2) V a,u+u' (Vl) 

u'—O 

which can be rewritten as 


Vafflb (l/l, 2/2) = V b ( 2 / 2 ) *V a (j/i) (5) 

where * denotes circular cross-correlation with period q. 
Defining 

9 (Gi, G 2 ) = 1 - min H [v b (y 2 ) * v a ( 2 / 1 )] (6) 

ff[Va(l/l)l = l-Gl 
H[vb(y2)]=l-G 2 

we obtain 

I(W aSb ) = ^2 W a@b (yi,y 2 ){l-H[v a@b (y ll y 2 )}} 

3/1»2/2 

< E X W a (y 1 )W b (V2)9(G 1 ,G 2 ) 

Gi,G 2 yi:H[v a (yi)]=l — Gi 
V2-H[v b (y 2 )]=l-G 2 

= W a {G 1 )W b (G 2 )g(G 1 ,G 2 ) 

g u g 2 

where the first equality is an application of 0, the inequality 
follows from 0. © and W aSb {yi,y 2 ) = W a {yi)W b {y 2 ), 
and © yields the last equality. If g (G 1 , G 2 ) is concave in G± 
and separately, not necessarily jointly, in G 2 


I {Wash) < 9 


Y J W a {.G l )G 1 ,Y J W a {G 2 )G 2 

_ G\ G 2 


= g[I{Wa),I{W b )] (7) 


and since W~ = W mW, I (W~) < g[I(W), I{W)]. If 
g (G 1 , G 2 ) is not concave in G\ and in G 2 , we can replace it 
with a concave upper-bound, and © will remain true. 





Note that by ©, v aB b,u (j/i, 2 / 2 ) = v bBa - u ( 2 / 2 , yi), where 
the subtraction is modulo q. Combining this with <l6]) yields 
9 (Gi, G 2 ) = g (G2, Gi). 

/i. Proved results about the QSC channel 

A q- ary symmetric channel (QSC) W (y | x) with error 
probability p is defined by 

W(y\x) = l l 7, P n y 7 X 

' ’ \ p/(q- 1) y ± x . 

Although the QSC channel does not maximize © for some 
pair (Gi,G 2 ), we observed that for q = 3 it provides an 
excellent approximation to the maximum, and we conjecture 
that this holds true for any prime q. 

Lemma 1. If W a and W b are QSC channels, then W aBb 
is a QSC channel as well. Furthermore, I (W aBb ) = 
g Q SC [I(W a ),I(W b )]for 


Since the constraints in this problem form a convex region, 
and by Lemma [3 we minimize a concave function, /(u), 
the result is obtained on the boundary of the convex region, 
and g = g. Note that Lemma [T] enables us to compute g 
efficiently using known algorithms for concave minimization 
over a convex region m. This algorithm generates linear 
programs whose solutions minimize the convex envelope of the 
original function over successively tighter polytopes enclosing 
the feasible region. As the polytopes become more complex 
and more tight, the generated solution becomes more precise. 

We can now prove the following. 

Lemma 4 . g (G-|, G 2 ) has the following properties: 

1) g(xi,yi) < g{x 2 ,y 2 ) for x\ < x 2 and yi < y 2 . 

2) g (1, G 2 ) = G 2 

3) <7 (Gi,G 2 ) < min(Gi, G 2 ). 

4) linw = 0 


g Q SG (Gt, G 2 ) = 1 -h q h- 1 (1 - GQ + hf 1 (1 - G 2 ) 

h-^l-GQh-^Q-Gi) ( 8 ) 

with h q (p) = - (1 - p) log ? (l - p) -p\og q {jTi) and h f l 


is the inverse of h q , that yields values in 


0, 


q-l 


The proof of this Lemma is a straightforward application 
of © and ©. 

Lemma 2. Using QSC channels W a and W b yields an extreme 
point in the Lagrangian related to © for Gi,G 2 > 0. 


The proof of this Lemma is also straightforward. 


Proof: Since X\ < x 2 and y\ < y 2 , the constraints 
for g(xi,yi) are tighter than the constraints for g(x 2 ,y 2 ). 
Since it is a maximization problem (1 — min), the maxi¬ 
mum for (xi,yi) would be smaller than the maximum for 
(x 2 ,y 2 ), i.e. < 7 (xi, 2 /i) < g{x 2 ,y 2 ). Since g = g, statement 
Q] follows. Statement [2] follows since for Gi = 1, v a (y\) 
is a circular permutation of [1,0,..., 0] T , so by © and 
©, H[v aSb {y 1 ,y 2 )] = H[v b (y 2 )\ Now, p(Gi,G 2 ) < 
g (Gi, 1) = Gi and p(Gi,G 2 ) < 3(1,G 2 ) = G 2 , which 
yields statement [3] Since © is a maximization problem, 
Lemma H yields that g(x,G 2 ) > gQsc (x,G 2 ), where g QS c 
is defined in ®. By parts 1) and 2), g(x,G 2 ) < p(l,G 2 ) = 
G 2 = gQsc(f,G 2 ). Also, straightforward calculations show 
that lim^i a905 g^ x ’ G ' 2 ^ = 0. Combining the above yields 
statement [4] ■ 


III. Analysis and Numerical Results 
Observe the similar to © problem 
g (Gi, G 2 ) = 1 - min H [v 6 (y 2 ) * v a ( 1 / 1 )] 

H[Va(yi)]>l — Gl 

HW h {yf)\>l-G 2 

First, we prove the following. 

Lemma 3. Define / (u) = minjj ( v )> 1 _ ( 5 H (u * v). Then, 
f (u) is concave. 

Proof: By definition, / (uo) = min h(v)>i-g H (uo *v) 
and / (ui) = H (ui *v). Then 

/ (mr 0 + (1 - a)ui) 

= min H (aun * v + (1 — a)ui * v) 

H(v)>l — G 

> min \aH (un * v) + (1 — a)H (ui * v)l 
H(v)>l-G 

>a min H (uo * v) + (1 — a) min fJ(ui*v) 
H(v)>l-G H(v)>l-G 

= af (u 0 ) + (1 - a)f (ui) 

where the first inequality follows from concavity of H, and 
the added degree of freedom to the minimization yields the 
second inequality. ■ 


Next, we calculate g{G\,G 2 ) for Gi,G 2 ~ 0 and 
for Gi,G 2 « 1. To simplify the notation, we will de¬ 
note V a (yi) = V a = K, 0 ^ 0 , 1 , • • • ,V a , q -l] T , V 6 (y 2 ) = 
Vb = [vb,o,Vb,i,...,Vb t q-i] and v amb {y u y 2 ) = v t = 

[^£, 0 : tit, 1) ■ ■ • ; Xt,q—l] ■ 

Lemma 5. For sufficiently small values of G\ and G 2 and 
q = 3, g (Gi,G 2 ) = ln3 ■ G\G 2 . 

Proof: Consider ©. For G 2 sufficiently small, v b y = 
1 /q + Ci where Ci are sufficiently small and e * = 

Using Taylor’s approximation, and 7 = q/(2\nq), H [v b ] = 
1 — 7 e f ■ s hM first solve the minimization problem 

in © for a fixed v a and G 2 ~ 0, so v t ,i = 1/q + 

Efc =0 e kV a ,i+k and H [v t ] = 1 - 7 Ei=o (Efc=o e kV a ,i+k) ■ 

Hence, 5 (Gi,G 2 ) = 7 max Jf'Cu (l>2lZo e/AVz+fc) = 

7 maxe T Ae s.t. e T e = G 2 / r ) and E?=o = 0- Here 

e = [e 0 ,..., e g -i] i and A = Ei=o v o 4 v a,i where v Qji is 
a cyclic shift by i of v a . Hence, 

q-l 

g (Gi, G 2 ) = G 2 maxe T Ae s.t. e T e = 1 , ^ a = 0 . (9) 

2=0 









Note that A is a circulant matrix, and for q = 3 


di,j 


E 2 2 • 

k—0 V a,k * = 3 

Sfc=0 v a,kV a ,k+1 i 7^ j 


so A has only two eigenvalues: Ai = 1 and A 2 = 5Jk=o v ak~ 
J2i=o v a,k v a,k+i < Ai. The eigenvector associated with Ai 
is u x = c[l,l,l] T so the linear constraint can be expressed 
as e T Ui = 0. Following ifTTl page 411, Th. 7], the solution 
to © is Gr 2 ^ 2 - The eigenvector associated with A 2 is e = 
c [1, —0.5, — 0.5] T , making Wb a QSC channel. Substituting it 
into © yields 


/ 2 2 n 

g (G U G 2 ) = G 2 f £ <i - E V a,iVa,i+l 
\ 2=0 2=0 / 


( 10 ) 


For G\ « 0, v a , t = 1/3 + S t , S t = 0 and J2i=o v l,i ~ 

Xli=0 v a,iVa,i +1 = X)i=0 ~ 0 ‘Wi+l- 

Since ^i=o = °> X)i=o ^ 2 = 2 (^1 + S\ + <^ 1 ^ 2 ) and 
Y?i= o ^A+i = - (^1 + ^2 + <^ 2 ) ■ Therefore, ^Lo J 2 - 
Z)i= 0 M*+i = !-5 Ei=o Combining this with ® 

yields the stated result. ■ 


Lemma 6. For G\ and G 2 sufficiently close to 1, and q = 3, 

< ? (G 1 ,G 2 ) = G 1 + G 2 -1 


Proof: Consider (O. For G\ sufficiently close to 1, we 
can assume without loss of generality that v a j = S t , i = 
1,... ,q — 1, where <5* are small, and u aj o = 1 — SiLi 
Similarly, for G 2 sufficiently close to 1, we can assume without 
loss of generality that Vb.i = e,, i = 1,..., q — 1, where e, are 
small, and = 1 - Y%Zi e i • Now , 1 — Gi = H [v a ] = 

- Ei= 1 ^ lo g 9 s i and 1 -G 2 =H [v 6 ] = - Y^Zi e * lo Sg e t- 
For Gi and G 2 sufficiently close to 1, v t « [1 — <5i — S 2 — 
er - e 2 , Si + e 2 + eiS 2 ,S 2 + ei + e2<5i] T . Hence, H [v t ] = 
—(Si+e 2 +eiS 2 ) \og 3 (5i+e 2 +ei5 2 ) — (5 2 +ei+e 2 5i) log 3 (i52 + 
e 1 + £ 2 ^ 1 )- Now, our main observation is that for a, b, c small, 

— (a+b+c) log(a+6+c) ~ — a log a —b log b— clog c. Applying 
this observation and —e<5 log(eA) << —eloge — A log <5 for 
small e and S yields H [v t ] ss 2 — Gi~G 2 so thatp(Gi,G 2 ) ~ 

Gi + G 2 - 1. ■ 



Fig. 1. Numerically calculated g (G i, G 2 ) for q = 3 
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Fig. 2. Numerically calculated 99 ^ < qq^ 2 ^ for q = 3 

for each G 2 £ (0,1) there exists x * s.t. d is positive 

for x < x* and negative for x > x*. 



Note that the same proof applies for a general q. 


We calculated the actual value of g numerically. We 
calculated g (O.Oln, 0.01m) for q = 3, n = 1,2,..., 99 and 
m = 1,2,..., 99. In Figure [j] we plot the contour of this 
function. This figure shows that g{G\,G 2 ) = g(G 2 ,Gi) as 
noted above, and, as proved in Lemma |4j g (1, G 2 ) = G 2 . 


Plotting the numeric d9 ^Q^ 2 ' > in Figure [2] shows that 
g(Gi,G 2 ) is increasing in G\ (and by symmetry, in G 2 ), 
as proved in Lemma [4] Next, using the calculated points, 
we estimate 9 ■ This estimated second derivative is 

shown in Figure [3] suggesting the following conjecture (since 
the bottom line represents — 9 ^pi G2) = 0, so below it 

9 9{ 'dG‘^ G ^ > > 0 and 9 9 ^g'^ G2 ' > < 0 above that line): 


Property 1. g (G] ,G 2 ) is concave in G \ (and by symmetry, 
in G 2 ), except for small values of G\ and G 2 . In other words, 


Therefore, the convex hull of g (Gi, G 2 ) for a given G 2 is 


Gi 

max — 
xG[Gi,1] X 


g (x, G 2 ) = < G j 


%^9{G\,G 2 ) 


9{Gi,G 2 ) 


G i < G\ 
Gi > G\ 


where G\ = argmax xg [ 0 g ( x ' G A Finding G\ is equivalent 

. i • dg(x,G2) 9 ( x ->G2) 4 . d 2 g(x,G2) ^ n • -c a’ 

to solving x z> s.t. —< 0, i.e. finding a 

tangent to g (x, G 2 ) at x s.t. 9 9 g X ’^ 2 ^ < 0, that passes through 

( 0 , 0 ). 

Lemma 7. If Property [7] holds, the problem x ■ da = 
g (x , G 2 ) s.t. 9 9 g X ^ 2 ^ < 0 has a single solution. 


The proof of this Lemma follows from analysis of x ■ 

^^- 9 (x,G 2 ). 

However, we want an upper bound on g{G\ 1 G 2 ) that 
would be concave in Gi and G 2 . Similarly to the case of 
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Fig. 3. Numerically calculated 9 f° r 9 


= 3 


Fig. 5. Numerically calculated ei(I(W)) — e*(/(FF)) for q = 3 
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Numerically calculated 9 9 for q = 3 


Fig. 6. The lower bound on I (FF + ) — I (FF), which is also a lower bound 
on I (FF) — I (W ~), for different values of q, and for the QSC channel 


fixed G 2 , 

g* (Gy,G 2 ) = max ——-g(x 1 ,x 2 ) (11) 

xi£[Gi,1] X\X2 
®2G[G2,1] 

Clearly, g* (Gi,G 2 ) > <?(Gi,G 2 ) and Figure Q] shows that 
g* (Gi, G 2 ) is concave in G\ and in G 2 (the lines at the bottom 
of the figure stand for the area where 9 9 ^q'I G ^ = 0 . 

Proposition 1. There exists ef(x) s.t. I (W~) + ef \I{W) 1 < 

i(w)<i(w+)-q [i(w )]. 

Proof: Set e*{x) = x — g*(x,x), where g*(x,x) was 
defined in (fill . Recalling that I (W~) < g* [I{W),I{W)\ 
and I ( W~ ) + I ( W + ) = 2 1(W) yields the stated result. ■ 

The minimal polarization step size is e*(ai) rather than 
ei(x) = x — g(x,x). However, ei(x) — e*(x) is very small, 
as seen in Figure 0 so we can use ei(x), which is easier to 
calculate. In Figure [6] we plot ef x) for different values of 
q, and see that for q = 3, e/( x) is close, but not equal to 


£i,QSc(x) = x + h q j/i 9 1 (1 - x) 2 - -^h q 1 (1 - x) j-1 
which is marked as “q = 3 QSC”. From Lemma [5] ti(x) ~ 
a;—In 3-a; 2 for x —> 0, so lim^^o = 1> as seen i n Figure 
[6] Lemma [6] yields ei(x) « 1 — x for x —> 1, as can be seen 
in Figure [6] Note that for q = 2, we would obtain the same 
ei(x) = e*(x) = ei,QSc(x) as in Q. 


Given some function fo(x), defined over [0,1] s.t. fo{x) > 
0 for a; € (0, 1), and /o(0) = /o(l) = 0, we define fk{x) for 
k = 1,2,... recursively as follows, 


fk( x) = sup 

ei(x)<e<e h (x) 


fk-l{x + e) + fk-l{x — e) 


where efx) = x — g{ x, x) and e^(x) = min(a;, 1 — x). 

Define L k (x) = and L k = sup ze ^ 0> ^L k (z). With 
the definition of f k (x), < L\ still holds as in (H). Simi- 




































IV. Future Research 



Fig. 7. A plot of log Lk(x) for k = 1 and k = 100, q = 3 and 
/o(ir) = (0.26x 2 + l) rr 0 ' 8 (l — i) 0 6 . The functions L^{x) were calculated 
numerically. 


larly to J8] Equation (11)] we have, for an integer 0 < k < n, 

E [f Q (/„)] < ■ ( VZ I)" ■ fo [I(W )] . (12) 

Similarly to GO we define J n = min(/„,l — I n ). Using 
fo{z) = (0.26x 2 + l) a;°- 8 (l — x) 0 - 6 similarly to 0 Lemma 

3] , we obtain P (J n > S) < ^ • 2 _01817ra . As can be seen 
in Figure [7] numerical calculations yield L i = 2 -0161 and, 
10 \/Liqo = 2 -01817 . A plot of i log 2 Lk as a function 

of k for q = 3 and fo(z) = (0.26x 2 + l) x°' 8 (l — x) 0 ' 6 
shows a convex decreasing function, similar to 0 Fig. 3], 
suggesting that it is reasonable to expect that for this particular 
fo(z), using k = 100 is already a good choice for (ITU 
(i.e., we cannot improve much by using an higher value of 
k). Similarly to 0 Lemma 4] we have the following. If 
P [lo € U : iri(w) ^ (<5,1 — <5) Vn > mo] > 1 — e for some 
integer mo, 0 < e < 1 and 5 < 1/3. Then 

P (w £ fl : I n (w) > 1 — S Vn > mo) > I(W) — e 
P (uj € Q : I n (u>) < 5 \/n> mo) > 1 — I(W) — e . 

The proof is essentially the same as the proof of 0 Lemma 

4] , with I n replacing 1 — Z„. Finally, we can obtain a 

result similar to |8] Theorem 1]. We use essentially the same 
proof but with the following modification. First we obtain a 
result similar to 0 Equation (25)] using the same approach: 
P (ui € O : /„(<*;) > 1 — S Vn > mo) > I(W) — • 

i_ 2 -p ■ Then we combine it with [II] Equation (2)] to obtain, 

P(u e n : Z n (oj) < C Vn > mo) > I(W) - (f*) • 
and proceed with the derivation in 0 Theorem 1 ], Since 
p = 0.1817, 1 -F 1/p = 6.504, we claim the following result 

Proposition 2. Suppose that we wish to use a polar code 
with rate R and blocklength N to transmit over a binary-input 
channel, W, with block error probability at most P®. Then it 
is sufficient to set N = tsot (or larger) where /3 is 

a constant that depends only on P®. 


In this paper we showed numerically that for the case where 
q = 3 we can obtain an improved lower bound on I(W) — 
I(W~) compared to the binary (q = 2 case). Consequently 
we can predict a much better scaling law of the blocklength 
with respect to I(W) — R compared to the results in J9]- It is 
interesting to continue this study for other values of prime q. 
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Appendix: Supplementary Material 
A. Proof of Lemma [7] 

Assume that W a and W b are QSC channels with error 
probabilities p a and p b , respectively. Then, for all y = 
0 ,1 ,..., q — 1, v Q (y) and v 0 ( y ) are circular shifts of v a and 
v b , respectively, where 

V a = [1 - Pa,Pa/(<7 ~ 1 ),Pa/{q ~ 1), ■ • • ,Pa/(<7 ~ 1)^ 

v& = [1 -p b ,Pb/(q-l),p b /{q- i),...,p b /{q-l)] T ■ 

Since for the QSC case, all v(y) vectors are shifts of some v, 
if VC is a QSC channel, /(VC) = 1 — 77 (v). This means 

/ (VC a ) = 1 hq (Pa) (13) 

I(W b ) = l-h q (p b ) . (14) 

Using ©. we see that v affl b (y 4 , y 2 ) are circular shifts 

Of v offl& = [l -pt,Pt/(q- l),Pi/(g- 1),. •. ,Pt/(q- l)] T , 
where 

Pt=Pa+Pb~qPaPb/(q~ 1) (15) 

so 1 V a mb is a QSC channel with error probability p t , and 
I {Warn) — 1 — h q ( p t ). Combined with (fT3k(fl~4T> and (fl5l ). 
this means that for the QSC case, © becomes an equality if 
<?(•, •) is defined as in ©. 


B. Proof of Lemma [2] 

Assume v a (t/i) = [u aj0 , u aj i,... ,v a , q -i} T and v b (y 2 ) = 
[vb,o,v b ,i,...,v bt q- 1 ] T . Using © yields v aBb (y 1 ,y 2 ) = 
[vt,o,vt t i,...,v t , q -i} where 
9-1 

v t .i = y ^v a jVb,j-i for i = 0,1 ,... ,q - 1. (16) 

j=o 

The Lagrangian related to solving the minimization in © is 


L = H [v offlb (yi,y 2 )] - Ai {H [v 0 (j/i)] - 1 + G i} 

- A 2 {H [v h (y 2 )] - 1 + G 2 } - A 3 v a<i - 1^ 


Az-i 


- a 4 y v b ,i -1 


^2=0 


9-1 


= ^ ^ V t ,i log g V t ,i + Al 


1=0 


9-1 


1 + ^2 Va > i l0g 9 Va ’ i ~ Gl 


i=0 


+ A 2 

— A 3 


9~1 

i+y v b ,i iog 9 v bti - g 2 

2=0 


'9-1 


y Ha ,2 -1 


, 2=0 


-a 4 


9-1 


y v b ,i -1 


2=0 


(17) 


and we want to achieve dL/dv a ,i = dL/ dv b ,i = 0 for i = 
0,1,..., q — 1. By ( fl6l >. dv t .jj dv a j = v b j~i and combining 
it with (fTTb and X]i=o v b,i = 1 yields 


dL 

dv a j 


1 

In q 


9-1 

y, v b j-i logg v t ,i + Al 
2=0 




-A 3 =0 Vj e {0, 1} . (18) 


If W a and W b are QSC channels, v a ,i = p a /(q — 1) and 
v b ,i = Pb/(q - 1) for i ± 0, v a ,o = 1 - Pa and u b>0 = 1 - p b . 
By (Q©, v t ,i = pt!{q - 1) for i f 0 and v t p = 1 - Pt, where 
p t is defined in (IT© For j 0, (IT© yields 




+ Ai ( log 

and for j = 0, m yields 


Pb 

q~ 1 

Pa , 1 


log, 


Pt 


q q - 1 In q 


q q- 1 

- A 3 =0 


- r~ ~ (1 - Pb) log„ (1 - Pt) ~ Pb log 

m q H q ~ 1 


+ Ai 


logo (1 - Pa) + 


1 

In q 


- a 3 = 0 . 


Now, if p a ^ 2-^—, i.e. Gi > 0, we have two independent 
equations, so we have a single possible value for Ai and A 3 . 
Combining these equations yields 


Ai = 1 - 


qp b 

g-l 


logo 


Pt 


log„ 


Pa 


As — 


q (q — 1) (1 — Pt) / 09 (<7-l)(l-pa) 

l° g 9 (1 - Pt) + (! - ffi) 1°§9 
l°g 9 - logo (1 - Pa) 

' log, (1-Pa) 

log, f(1 - Pft) log, (1 - Pt) + Pb log, 


logo T^l - logo (1-Pa) 


Ai — 1 

In q 


Similarly, by (IT© dvt,i/ dv b ,j = v a ,j+i and combining it with 
m and o H a ,i = 1 yields 


dL 


9-1 


dv. 


b,j 


= - - y H a ,j + i logo V Li + ( l° g 9 V b,j + - 


2=0 


1 


-A 4 = 0 Vj e {0,1,..., g — 1} . (19) 
If W a and 144 are QSC channels for j f 0, ( IT9l > yields 


1 


Pa 


In q <7 — 1 


log, (1 - Pt) - 1 - 


Pa 

q-l 


log. 


Pt 

q-l 


+ A 2 ( 1 ° g 9^ I + I ^)-A 4 -0 


and for j = 0, (IT9t yields 
1 


- [1 - p a ] log [1 - pt] ~ p a log 


Pt 


In q ^ bq q-l 


+ A 2 


log, (1 ~Pb) + 


In <7 


- A 4 = 0 . 


Now, if p b f ' i- e - G 2 > 0, we have two independent 
equations, so we have a single possible value for A 2 and A 4 . 


































Combining these equations yields 



' l0g « (q - 1K1 - Pt ) / l0g ^ (q - l)(l - p b ) 

gr lQ g 9 (! ~ Pt) + ( x - gr) lp g g ITT 

!ogg - log, (1 - Pb) 

•logg(l-Pb) 

iogg { [1 - Pa] log, [1 - Pt] + Pa logg | 

iogg - logg [1 - Pb] 


Since we have found Ai,..., A 4 that solve dT8l) and (fl9t for 
the case of W„ and 144 being QSC channels, we proved that 
the QSC case yields a critical point in the Lagrangian related 
to © for any value of q. 


C. Properties of gQsc used in the proof of Lemma |7] 

By ®, 


9qsc (1, G 2 ) = 1 - h q 


K 1 (0) + h- 1 (1 - G 2 ) 

q h- 1 (0)h~ 1 (l-G 2 ) 


q~ 1 


= l-h q [hf l (1 - G 2 )\ = g 2 

Straightforward calculations show that 


dgQSC (Gi, G 2 ) 
5 Gi 


log, [(«-l)(i-l)] 

1 - -LrV 
9-f J 

lOgg 

'(9-1) 

|m 

1 

^ >— 1 ^ 



( 20 ) 


where y = 1 (1 — Gi), v = h q 1 (1 — G 2 ) and 2 : = 

y {l — v) + v ^1 — ■ These functions are plotted in Figure 

[U By d20t . lim^-^i dgc i sc ^ G2 '> = 0 (since in this case y = 0 


and 2 ; = v ). 


D. A proof that (x + y) ln(x + y) ~ x In x + y In y for small 
positive x ,y 


We are going to prove that 

„ x In x + y In y „ In 2 

1 < -—i<1- 

(x + y) ln(x + y) ln(x + y) 


so 


lim 

x,y-y 


x In x + y In y 
0 (x + y) ln(x + y) 


= 1 . 


First, since — a; In a; is concave. 



Fig. 8. 


9gQsc(G 1 ,G 2 ) 

dGi 


for q = 3 


so, dividing both sides by — 0.5(x + y) ln(x + y) yields 
x In x + y In y < In 2 

(x + y) ln(x + y) ~ ln(x + y) 

For the other direction we must prove that 

— x In x — y In y > — (x + y) ln(x + y) . 

It is equivalent to 

x [ln(x + y) — lnx] > y [lnp — ln(x + y)\ 

Since In is an increasing function, the left hand side of the 
inequality above is positive, and the right hand side is negative, 
so it is a true statement. 

Note that for q variables (instead of 2) the first half of the 
proof is similar, using q instead of 2, and the second half is 
modified using q— 1 induction steps, one for each sum. 

E. Proof of Lemma \7\ 

Define f(x) = x ■ — 9 (x, G 2 ). We wish to 

prove that f(x) = 0 has exactly one solution that satisfies 
9 < 0- First, f(x) = x ■ 9 g g^ G ^ . Since there exists 

x* s.t. 9 is positive for x < x* and negative for x > x* 

(See Property|T|i. f(x) is increasing for x < x* and decreasing 
for x > x*. Combining this with /(0) = 0 yields that /(x) > 0 
for 0 < x < x*. Lemma Q] shows that lim x _>i = 0 

and g (1, G 2 ) = G 2 , so f(x) = —G 2 . Since / (x*) > 

0, liiiLj;_>i /(x) < 0, and /(x) is decreasing for x* < x < 1, 
/(x) = 0 has exactly one solution for x* < x < 1. The 
only other solution to f(x) = 0 is x = 0, and in this point 


—x In x ~ y In y 


< - 


2 

x + y 


In 


x + y 


ln(x + y) 


x + y 


In 2 


2 















































